Shadow Theory, data model design for data integration
نویسنده
چکیده
In information ecosystems, semantic heterogeneity is known as the root issue for the difficulties of data integration, and the Relational Model is not designed for addressing such challenges, i.e. to re-use data that is modeled from other sources in the local data model design. Although researchers have proposed many different approaches, and software vendors have designed tools to help the data integration task, it remains an art relying on human labors. Due to lacks of a comprehensive theory to guide the overall modeling process, the quality of the integrated data heavily depends on the data integrators’ experiences. Based on observations of practical issues for enterprise customer data integration, we believe that the needed solution is a new data model. This new data model needs to manage the difficulties of semantic heterogeneity: different data collected from different perspectives about the same subject matter can naturally be inconsistencies or even in conflicts. In other words, existing data models are designed to support single version of the truth, and naturally they will have difficulties during data integration when data collected from different sources are based on different perspectives or at different levels of abstraction. Therefore, we propose Shadow Theory to serve as the philosophical foundation in order to design a new data model. The kernel of the theory is based on the notion of shadow, which can be traced back to Plato’s Allegory of the Cave over 2000 years ago. The basic idea is that whatever we can observe and store into databases about the subject matter are just shadows. Meanings of shadows are mental entities that exist only in the viewers’ cognitive structures. Such mental entities are constrained by the viewers’ internal model about the reality, especially the implicitly or explicitly chosen perspective(s) or ontology, if formally represented. In this paper, we propose six basic principles to guide the overall data model design. Further, we also propose algebra with a set of basic operators to support data operations by their meanings, not by their logical structure. The representation is based on pointfree geometry such that any meaning is represented as an area in semantic space, which can be decomposed or aggregated in different ways concurrently. We use W(hat)-tags to attach on shadows for their meanings, and E(quivalence)-tags to recognize what meanings can be treated as the same. We use enterprise customer data integration as an example to illustrate the data model design and operation principles.
منابع مشابه
Effective strategies for the establishment of integration model for succession planning and career progression path of iranian azad universities administrators (Qualitative Research Based Grounded Theory Approach)
• AbstractThis research was done to provide effective strategies for the establishment of integration model for succession planning and career progression path of azad universities administrators from the perspective of higher education administrators and professionals based on the systematic theoretical design of Strauss and Corbin grounded theory. Accordingly, using a targeted sampling method...
متن کاملتدوین الگوی کیفی مدیریت فرآیندهای یاددهی– یادگیری در دوره ابتدایی
The aim of study was to develop a qualitative model of teaching-learning processes management(QMTLPM)for the elementary school classrooms .According to the study, criteria and indicators for QMTLPM through in-depth study of available sources and interviews with focus groups were identify.in term of design is qualitative and in term of strategy is qualitative case study research. Potential study...
متن کاملIntegration and Reduction of Microarray Gene Expressions Using an Information Theory Approach
The DNA microarray is an important technique that allows researchers to analyze many gene expression data in parallel. Although the data can be more significant if they come out of separate experiments, one of the most challenging phases in the microarray context is the integration of separate expression level datasets that have gathered through different techniques. In this paper, we prese...
متن کاملImpact of Oil, Crises and Economic Integration on Growth:
While energy especially oil, crises and economic integration have been playing an important role indevelopment and growth in East West Asia economies and their intertwining relations, only limitedquantitative research on their impact has been carried out and reported for improved debate andcredible policy use. This paper uses an econometric modeling innovation with features superior toexisting ...
متن کاملDesigning and validating a Model for Integration of Professional Ethics Components with Technical Competencies for Industrial Mechanics Branch
Background: The emerging world of work requires the acquisition of a set of non-technical competencies with technical competencies in a career for sustainable employment. This paper aims at designing and validating a model for integrating the ethical components with technical competencies in curriculum based on competency in industrial mechanics’ branch. Method: The research approach is based ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1209.2647 شماره
صفحات -
تاریخ انتشار 2012